
RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/407,806 



DATE: 11/03/1999 
TIME: 13:58:51 



INPUT SET; S33828.raw 



This Raw Listing contains the General 
Information Section and up to the first 5 pages. 



(ii) TITLE OF THE INVENTION: ALPHA- GALACTOS IDASE 

(iii) NUMBER OF SEQUENCES: 4 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Fish & Richardson, P.C. 

(B) STREET: 4225 Executive Square, Suite 1400 

(C) CITY: La Jolla 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 92037 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: Windows95 

(D) SOFTWARE: FastSEQ for Windows Version 2.0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/407,806 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/613,22 0 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Haile, Ph.D., Lisa A. 

(B) REGISTRATION NUMBER: 38,347 

(C) REFERENCE/DOCKET NUMBER: 09010/004001 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-678-5070 

(B) TELEFAX: 619-68-5099 

(C) TELEX: 



SEQUENCE LISTING 



General Information 



(i) APPLICANT: Murphy, Dennis 
Reid, John 




(2) INFORMATION FOR SEQ ID NO:l: 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/407,806 



DATE: 11/03/1999 
TIME: 13:58:51 



INPUT SET: S33828.raw 

47 (i) SEQUENCE CHARACTERISTICS: 

48 (A) LENGTH: 52 base pairs 

49 (B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: single 

51 (D) TOPOLOGY: linear 
52 

53 (ii) MOLECULE TYPE: cDNA 
54 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 
56 

57 CCGAGAATTC ATTAAAGAGG AGAAATTAAC TATGAGAGCG CTCGTCTTTC AC 52 
58 

59 (2) INFORMATION FOR SEQ ID NO : 2 : 
60 

61 (i) SEQUENCE CHARACTERISTICS: 

62 (A) LENGTH: 31 base pairs 

63 (B) TYPE: nucleic acid 

64 (C) STRANDEDNESS: single 

65 (D) TOPOLOGY: linear 
66 

67 (ii) MOLECULE TYPE: cDNA 
68 

69 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
70 

71 CGGAAGATCT AGGTTCCCCA TTTTCACCCC T 31 
72 

73 (2) INFORMATION FOR SEQ ID NO : 3 : 
74 

75 (i) SEQUENCE CHARACTERISTICS: 

76 (A) LENGTH: 1041 base pairs 

77 (B) TYPE: nucleic acid 

78 (C) STRANDEDNESS: single 

79 (D) TOPOLOGY: linear 
80 

81 (ix) FEATURE: 

82 (A) NAME/KEY: Coding Sequence 

83 (B) LOCATION: 1...1038 

84 (D) OTHER INFORMATION: 
85 

86 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
87 

88 TTG AGA GCG CTC GTC TTT CAC GGC AAC CTC CAG TAT GCC GAA ATC CCA 48 

89 Leu Arg Ala Leu Val Phe His Gly Asn Leu Gin Tyr Ala Glu lie Pro 

90 1 5 10 15 
91 

92 AAG AGC GAA CCA AAG GTC ATA GAG AAG GCA TAC ATC CCA GTC ATC GAG 96 

93 Lys Ser Glu Pro Lys Val lie Glu Lys Ala Tyr lie Pro Val lie Glu 

94 20 25 30 
95 

96 ACA CTG ATT AAA GAA GAA CCT TTT GGG CTC AAC ATA ACG GGC TAT ACC 144 

97 Thr Leu lie Lys Glu Glu Pro Phe Gly Leu Asn lie Thr Gly Tyr Thr 

98 35 40 45 
99 



PAGE: 3 RAW SEQUENCE LISTING DATE: 11/03/1999 

PATENT APPLICATION US/09/407,806 TIME: 13:58:52 

INPUT SET: S33828.raw 

100 TTA AAG TTC CTC CCG AAG GAT ATT ATA CTC GTT AAA GGG GGC ATC GCG 192 

101 Leu Lys Phe Leu Pro Lys Asp lie lie Leu Val Lys Gly Gly lie Ala 

102 50 55 60 
103 

104 AGT GAC CTG ATA GAG ATA ATC GGA ACG AGC TAC ACG GCA ATA CTC CCC 240 

105 Ser Asp Leu lie Glu lie lie Gly Thr Ser Tyr Thr Ala lie Leu Pro 

106 65 70 75 80 
107 

108 CTC CTG CCG CTT AGC AGA GTA GAA GCA CAA GTT CAG AGA GAT AGG GTT 2 88 

109 Leu Leu Pro Leu Ser Arg Val Glu Ala Gin Val Gin Arg Asp Arg Val 

110 85 90 95 
111 

112 AAG GAA GAG CTC TTC GAG GTT TCT CCA AAG GGA TTC TGG CTG CCA GAG 3 36 

113 Lys Glu Glu Leu Phe Glu Val Ser Pro Lys Gly Phe Trp Leu Pro Glu 

114 100 105 110 
115 

116 CTC GCC GAC CCG ATA ATC CCT GCC ATA CTG AAG GAC AAC GGT TAT GAG 384 

117 Leu Ala Asp Pro lie lie Pro Ala lie Leu Lys Asp Asn Gly Tyr Glu 

118 115 120 125 
119 

120 TAT CTA TTC GCC GAC GAG GCG ATG CTT TTC TCA GCT CAT CTC AAC TCG 432 

121 Tyr Leu Phe Ala Asp Glu Ala Met Leu Phe Ser Ala His Leu Asn Ser 

122 130 135 140 
123 

124 GCG ATA AAG CCA ATT AAA CCG CTC CCA CAC CTT ATA AAG GCC CAA AGG 480 

125 Ala lie Lys Pro lie Lys Pro Leu Pro His Leu lie Lys Ala Gin Arg 

126 145 150 155 160 
127 

128 GAA AAG CGC TTT AGG TAC ATC AGC TAT CTC CTT CTC AGG GAG CTT AGG 528 

12 9 Glu Lys Arg Phe Arg Tyr lie Ser Tyr Leu Leu Leu Arg Glu Leu Arg 
130 165 170 175 

131 

132 AAG GCG ATA AAG CTC GTT TTT GAA GGT AAG GTA ACG CTA AAG GTC AAA 576 

133 Lys Ala lie Lys Leu Val Phe Glu Gly Lys Val Thr Leu Lys Val Lys 

134 180 185 190 
135 

13 6 GAC ATC GAA GCC GTA CCC GTT TGG GTG GCC GTG AAC ACG GCT GTA ATG 624 

137 Asp lie Glu Ala Val Pro Val Trp Val Ala Val Asn Thr Ala Val Met 

138 195 200 205 
139 

140 CTC ATC GGA AGG CTT CCT CTT ATG AAT CCT AAG AAA GTG GCG AGC TGG 672 

141 Leu lie Gly Arg Leu Pro Leu Met Asn Pro Lys Lys Val Ala Ser Trp 

142 210 215 220 
143 

144 ATA GAG GAC AAG AAC ATT CTT CTA TAC GGC ACC GAT ATA GAG TTC ATT 720 

145 lie Glu Asp Lys Asn lie Leu Leu Tyr Gly Thr Asp lie Glu Phe He 

146 225 230 235 240 
147 

148 GGC TAT AGG GAC ATT GCA GGC AGA ATG AGT GTT GAG GGA TTA TTA GAG 768 

149 Gly Tyr Arg Asp He Ala Gly Arg Met Ser Val Glu Gly Leu Leu Glu 

150 245 250 255 
151 

152 GTT ATA GAC GAG CTC AAC TCG GAA CTG TGC CCC TCA GAG CTG AAG CAC 816 
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176 
177 
178 
179 
180 
181 
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RAW SEQUENCE LISTING DATE: 11/03/1999 

PATENT APPLICATION US/09/407,806 TIME: 13:58:52 

INPUT SET: S33828.raw 
Val lie Asp Glu Leu Asn Ser Glu Leu Cys Pro Ser Glu Leu Lys His 
260 265 270 

AGT GGA AGG GAG CTC TAC TTA CGG ACT TCG AGT TGG GCA GAT AAG AGC 864 
Ser Gly Arg Glu Leu Tyr Leu Arg Thr Ser Ser Trp Ala Asp Lys Ser 
275 280 285 

TTG AGG ATA TGG AGA GAG GAC GAA GGG AAC GCA AGA CTT AAT ATG CTG 912 
Leu Arg lie Trp Arg Glu Asp Glu Gly Asn Ala Arg Leu Asn Met Leu 
290 295 300 

TAC AAT ATG AGG GGC GAA CTC GCC TTT TTA GCC GAG AAC AGC GAT GCA 960 
Tyr Asn Met Arg Gly Glu Leu Ala Phe Leu Ala Glu Asn Ser Asp Ala 
305 310 315 320 

AGG GGA TGG CCC CTC CCT GAG AGG AGG CTG GAT GCC TTC CGG GCG ATA 1008 
Arg Gly Trp Pro Leu Pro Glu Arg Arg Leu Asp Ala Phe Arg Ala lie 
325 330 335 

TAT AAC GAT TGG AGG GGT AAT GGG GAA CCT TAG 1041 
Tyr Asn Asp Trp Arg Gly Asn Gly Glu Pro 
340 345 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 346 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 



Leu 


Arg 


Ala 


Leu 


Val 


Phe 


His 


Gly 


Asn 


Leu 


Gin 


Tyr Ala 


Glu 


He 


Pro 


l 








5 










10 








15 




Lys 


Ser 


Glu 


Pro 
20 


Lys 


Val 


He 


Glu 


Lys 
25 


Ala 


Tyr 


He Pro 


Val 
30 


He 


Glu 


Thr 


Leu 


He 
35 


Lys 


Glu 


Glu 


Pro 


Phe 
40 


Gly 


Leu 


Asn 


lie Thr 
45 


Gly 


Tyr 


Thr 


Leu 


Lys 
50 


Phe 


Leu 


Pro 


Lys 


Asp 
55 


He 


He 


Leu 


Val 


Lys Gly 
60 


Gly 


He 


Ala 


Ser 


Asp 


Leu 


He 


Glu 


He 


He 


Gly 


Thr 


Ser 


Tyr 


Thr Ala 


He 


Leu 


Pro 


65 










70 










75 








80 


Leu 


Leu 


Pro 


Leu 


Ser 
85 


Arg 


Val 


Glu 


Ala 


Gin 
90 


Val 


Gin Arg 


Asp 


Arg 
95 


Val 


Lys 


Glu 


Glu 


Leu 
100 


Phe 


Glu 


Val 


Ser 


Pro 
105 


Lys 


Gly 


Phe Trp 


Leu 
110 


Pro 


Glu 


Leu 


Ala 


Asp 
115 


Pro 


He 


He 


Pro 


Ala 
120 


He 


Leu 


Lys 


Asp Asn 
125 


Gly 


Tyr 


Glu 
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RAW SEQUENCE LISTING 

PATENT APPLICATION US/09/407,806 



DATE: 11/03/1999 
TIME: 13:58:53 



206 


Tvr 


Leu 


Phe 


Ala 


Asp 


Glu 


Ala 


Met 


207 




130 










135 




208 


Ala 


He 


Lys 


Pro 


He 


Lys 


Pro 


Leu 


209 


145 










150 






210 


Glu 


Lys 


Arg 


Phe 


Arg 


Tyr 


He 


Ser 


211 










165 








212 


Lys 


Ala 


He 


Lys 


Leu 


Val 


Phe 


Glu 


213 








180 










214 


Asp 


He 


Glu 


Ala 


Val 


Pro 


Val 


Trp 


215 






195 










200 


216 


Leu 


He 


Gly Arg 


Leu 


Pro 


Leu 


Met 


217 




210 










215 




218 


He 


Glu 


Asp 


Lys 


Asn 


He 


Leu 


Leu 


219 


225 










230 






220 


Gly Tyr Arg Asp 


He 


Ala 


Gly 


Ara 


221 










245 








222 


Val 


He 


Asp 


Glu 


Leu 


Asn 


Ser 


Glu 


223 








260 










224 


Ser Gly Arg Glu 


Leu 


Tyr 


Leu 


Arg 


225 






275 










280 


226 


Leu 


Arg 


He 


Trp 


Arg 


Glu 


Asp 


Glu 


227 




290 










295 




228 


Tyr 


Asn 


Met 


Arg 


Gly 


Glu 


Leu 


Ala 


229 


305 










310 






230 


Arg Gly Trp 


Pro 


Leu 


Pro 


Glu 


Arg 


231 










325 








232 


Tyr Asn Asp 


Trp 


Arg 


Gly 


Asn 


Gly 



233 340 

234 

235 

236 

237 



INPUT SET: S33828.raw 



Leu 


Phe 


Ser 


Ala 
140 


His 


Leu 


Asn 


Ser 


Pro 


His 


Leu 
155 


He 


Lys 


Ala 


Gin 


Arg 
160 


Tyr 


Leu 
170 


Leu 


Leu 


Arg 


Glu 


Leu 
175 


Arg 


Gly 


Lvs 


Val 


Thr 


Leu 


Lvs 


Val 


Lvs 


185 










190 






Val 


Ala 


Val 


Asn 


Thr 
205 


Ala 


Val 


Met 


Asn 


Pro 


Lys 


Lys 
220 


Val 


Ala 


Ser 


Trp 


Tyr 


Gly 


Thr 
235 


Asp 


He 


Glu 


Phe 


He 
240 


Met 


Ser 


Val 


Glu 


Gly 


Leu 


Leu 

£t -J -5 


Glu 


Leu 


Cys 


Pro 


Ser 


Glu 


Leu 


Lys 


His 


265 










270 






Thr 


Ser 


Ser 


Trp 


Ala 
285 


Asp 


Lys 


Ser 


Gly 


Asn 


Ala 


Arg 
300 


Leu 


Asn 


Met 


Leu 


Phe 


Leu 


Ala 
315 


Glu 


Asn 


Ser 


Asp 


Ala 
320 


Arg 


Leu 
330 


Asp 


Ala 


Phe 


Arg 


Ala 
335 


He 



Glu Pro 



345 
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SEQUENCE VERIFICATION REPORT 

PATENT APPLICATION US/09/407,806 



DATE: 11/03/1999 
TIME: 13:58:53 



INPUT SET: S33828.raw 



Line Error Original Text 



SEQUENCE MISSING ITEM REPORT 
PATENT APPLICATION US/09/407,806 



DATE: 11/03/1999 
TIME: 13:58:53 



INPUT SET: S33828.raw 



< < THERE ARE NO ITEMS MISSING > > 
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SEQUENCE CORRECTION REPORT 

PATENT APPLICATION US/09/407,806 



DATE: 11/03/1999 
TIME: 13:58:53 



INPUT SET: S3 3 82 8. raw 



Line Original Text Corrected Text 

3 (1) General Information (1) GENERAL INFORMATION: 

8 (ii) TITLE OF THE INVENTION: ALPHA-GALACTOSID (ii) TITLE OF INVENTION: ALPHA-GALACTOSIDASE 



